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SEQUENCE LISTING 
(1) GENERAL INFORMATION 

(i) APPLICANTS: James M. Anderson 

Christina M. Van Itallie 

(ii) TITLE OF INVENTION: Human Occludin, Its Uses 
and Enhancement of Drug Absorption Using Occlu- 
din Inhibitors 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE: Yale University Medical School 
Section of Pulmonary and Critical Care Medicine 
Department of Internal Medicine 

(B) STREET: 333 Cedar Street, LCI 105 

(C) CITY: New Haven 

(D) STATE: Connecticut 

(E) COUNTRY: United States of America 

(F) ZIP CODE: 065220-8057 

(v) COMPUTER READABLE FORM 

(A) MEDIUM TYPE: 3.5" 1.44 Mb diskette 

(B) COMPUTER: IBM PC 

(C) OPERATING SYSTEM: MS DOS 

(D) SOFTWARE: Word Processing 

(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: June 25, 2001 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: U.S. 09/142,732 

(B) FILING DATE: September 15, 1998 

(C) CLASSIFICATION: 53 0-350 . 000 

(viii) ATTORNEY INFORMATION 

(A) NAME: Mary M. Krinsky 

(B) REGISTRATION NO. : 32423 

(C) REFERENCE/DOCKET NUMBER: OCR-754.CIP 

(ix) TELECOMMUNICATION INFORMATION 



(A) TELEPHONE NUMBER: 203-773-9544 

(B) TELEFAX NUMBER: 203-773-1183 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2312 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: cDNA 



(v) FRAGMENT TYPE: complete sequence 



(ix) 


FEATURE : 












(A) NAME/KEY: human 


occludin 






(xi) 


SEQUENCE 


DESCRIPTION: SEQ ID NO: 1: 




GCCTCTCTCC 


ATCAGACACC 


CCAAGGTTCC 


ATCCGAAGCA 


GGCGGAGCAC 


50 


CGAACGCACC 


CCGGGGTGGT 


CAGGGACCCC 


CATCCGTGCT 


GCCCCCTAGG 


100 


AGCCCGCGCC 


TCTCCTCTGC 


GCCCCGCCTC 


TCGGGCCGCA 


ACATCGCGCG 


150 


GTTCCTTTAA 


CAGCGCGCTG 


GCAGGGTGTG 


GGAAGCAGGA 


CCGCGTCCTC 


200 


CCGCCCCCTC 


CCATCCGAGT 


TTCAGGTGAA 


TTGGTCACCG 


AGGGAGGAGG 


250 


CCGACACACC 


ACACCTACAC 


TCCCGCGTCC 


ACCTCTCCCT 


CCCTGCTTCC 


300 


TCTTGGCGGA 


GGCGGCAGGA 


ACCGAGAGCC 


AGGTCCAGAG 


CGCCGAGGAG 


350 


CCGGTCTAGG 


ACGCAGCAGA 


TTGGTTTATC 


TTGGAAGCTA 


AAGGGCATTG 


400 


CTCATCCTGA 


AGATCAGCTG 


ACCATTGACA 


ATCAGCCATG 


TCATCCAGGC 


450 


CTCTTGAAAG 


TCCACCTCCT 


TACAGGCCTG 


ATGAATTCAA 


ACCGAATCAT 


500 


TATGCACCAA 


GCAATGACAT 


ATATGGTGGA 


GAGATGCATG 


TTCGACCAAT 


550 


GCTCTCTCAG 


CCAGCCTACT 


CTTTTTACCC 


AGAAGATGAA 


ATTCTTCACT 


600 


TCTACAAATG 


GACCTCTCCT 


CCAGGAGTGA 


TTCGGATCCT 


GTCTATGCTC 


650 


ATTATTGTGA 


TGTGCATTGC 


CATCTTTGCC 


TGTGTGGCCT 


CCACGCTTGC 


700 


CTGGGACAGA 


GGCTATGGAA 


CTTCCCTTTT 


AGGAGGTAGT 


GTAGGCTACC 


750 


CTTATGGAGG 


AAGTGGCTTT 


GGTAGCTACG 


GAAGTGGCTA 


TGGCTATGGC 


800 


TATGGTTATG 


GCTATGGCTA 


CGGAGGCTAT 


ACAGACCCAA 


GAGCAGCAAA 


850 


GGGCTTCATG 


TTGGCCATGG 


CTGCCTTTTG 


TTTCATTGCC 


GCGTTGGTGA 


900 


TCTTTGTTAC 


CAGTGTTATA 


AGAl'CTGAAA 


TGTCCAGAAC 


AAGAAGATAC 


950 


TACTTAAGTG 


TGATAATAGT 


GAGTGCTATC 


CTGGGCATCA 


TGGTGTTTAT 


1000 


TGCCACAATT 


GTCTATATAA 


TGGGAGTGAA 


CCCAACTGCT 


CAGTCTTCTG 


1050 


GATCTCTATA 


TGGTTCACAA 


ATATATGCCC 


TCTGCAACCA 


ATTTTATACA 


1100 


CCTGCAGCTA 


CTGGACTCTA 


CGTGGATCAG 


TATTTGTATC 


ACTACTGTGT 


1150 


TGTGGATCCC 


CAGGAGGCCA 


TTGCCATTGT 


ACTGGGGTTC 


ATGATTATTG 


1200 



TGGCTTTTGC 


TTTAATAATT 
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TTCTTTGCTG 


TGAAAACTCG 


AAGAAAGATG 


1250 


GACAGGTATG 


ACAAGTCCAA 


TATTTTGTGG 


GACAAGGAAC 


ACATTTATGA 


1300 


TGAGCAGCCC 


CCCAATGTCG 


AGGAGTGGGT 


TAAAAATGTG 


TCTGCAGGCA 


1350 


CACAGGACGT 


GCCTTCACCC 


CCATCTGACT 


ATGTGGAAAG 


AGTTGACAGT 


1400 


CCCATGGCAT 


ACTCTTCCAA 


TGGCAAAGTG 


AATGACAAGC 


GGTTTTATCC 


1450 


AGAGTCTTCC 


TATAAATCCA 


CGCCGGTTCC 


TGAAGTGGTT 


CAGGAGCTTC 


1500 


CATTAACTTC 


GCCTGTGGAT 


GACTTCAGGC 


AGCCTCGTTA 


CAGCAGCGGT 


1550 


GGTAACTTTG 


AGACACCTTC 


AAAAAGAGCA 


CCTGCAAAGG 


GAAGAGCAGG 


1600 


AAGGTCAAAG 


AGAACAGAGC 


AAGATCACTA 


TGAGACAGAC 


TACACAACTG 


1650 


GCGGCGAGTC 


CTGTGATGAG 


CTGGAGGAGG 


ACTGGATCAG 


GGAATATCCA 


1700 


CCTATCACTT 


CAGATCAACA 


AAGACAACTG 


TACAAGAGGA 


ATTTTGACAC 


1750 


TGGCCTACAG 


GAATACAAGA 


GCTTACAATC 


AGAACTTGAT 


GAGATCAATA 


1800 


AAGAACTCTC 


CCGTTTGGAT 


AAAGAATTGG 


ATGACTATAG 


AGAAGAAAGT 


1850 


GAAGAGTACA 


TGGCTGCTGC 


TGATGAATAC 


AATAGACTGA 


AGCAAGTGAA 


1900 


GGGATCTGCA 


GATTACAAAA 


GTAAGAAGAA 


TCATTGCAAG 


CAGTTAAAGA 


1950 


GCAAATTGTC 


ACACATCAAG 


AAGATGGTTG 


GAGACTATGA 


TAGACAGAAA 


2000 


ACATAGAAGG 


CTGATGCCAA 


GTTGTTTGAG 


AAATTAAGTA 


TCTGACATCT 


2050 


CTGCAATCTT 


CTCAGAAGGC 


AAATGACTTT 


GGACCATAAC 


CCCGGAAGCC 


2100 


AAACCTCTGT 


GAGCATCACA 


AAGTTTTGGG 


TTGCTTTAAC 


ATCATCAGTA 


2150 


TTGAAGCATT 


TTATAAATCG 


CTTTTGATAA 


TCAACTGGGC 


TGAACAACTC 


2200 


CAATTAAGGA 


TTTTATGCTT 


TAAACATTGG 


TTCTTGTATT 


AAGAATGAAA 


2250 


TACTGTTTGA 


GGTTTTTAAG 


CCTTAAAGGA 


AGGTTCTGGT 


GTGAACTAAA 


2300 


CTTTCACACC 


CC 








2312 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: polypeptide 
(v) FRAGMENT TYPE: complete sequence 
(ix) FEATURE: 

(A) NAME/KEY: human occludin 




(xi) 
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SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ser Ser Arg Pro Leu Glu Ser Pro Pro Pro Tyr Arg Pro Asp 

5 10 15 

Glu Phe Lys Pro Asn His Tyr Ala Pro Ser Asn Asp lie Tyr Gly 

20 25 30 

Gly Glu Met His Val Arg Pro Met Leu Ser Gin Pro Ala Tyr Ser 

35 40 45 

Phe Tyr Pro Glu Asp Glu lie Leu His Phe Tyr Lys Trp Thr Ser 

50 55 60 

Pro Pro Gly Val lie Arg lie Leu Ser Met Leu He He Val Met 

65 70 75 

Cys He Ala He Phe Ala Cys Val Ala Ser Thr Leu Ala Trp Asp 

80 85 90 

Arg Gly Tyr Gly Thr Ser Leu Leu Gly Gly Ser Val Gly Tyr Pro 

95 100 105 

Tyr Gly Gly Ser Gly Phe Gly Ser Tyr Gly Ser Gly Tyr Gly Tyr 

110 115 120 

Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Gly Tyr Thr Asp Pro Arg 

125 130 135 

Ala Ala Lys Gly Phe Met Leu Ala Met Ala Ala Phe Cys Phe He 

140 145 150 

Ala Ala Leu Val He Phe Val Thr Ser Val He Arg Ser Glu Met 

155 160 165 

Ser Arg Thr Arg Arg Tyr Tyr Leu Ser Val He He Val Ser Ala 

170 175 180 

He Leu Gly He Met Val Phe He Ala Thr He Val Tyr He Met 

185 190 195 

Gly Val Asn Pro Thr Ala Gin Ser Ser Gly Ser Leu Tyr Gly Ser 



Gly Leu Tyr Val Asp Gin Tyr Leu Tyr His Tyr Cys Val Val Asp 

230 235 240 

Pro Gin Glu Ala He Ala He Val Leu Gly Phe Met He He Val 



200 



205 



210 





245 



250 



255 



Ala Phe Ala 
Met Asp Arg 
lie Tyr Asp 
Val Ser Ala 
Val Glu Arg 
Val Asn Asp 
Pro Val Pro 
Asp Asp Phe 
Thr Pro Ser 
Lys Arg Thr 
Gly Glu Ser 
Pro Pro lie 
Phe Asp Thr 
Asp Glu lie 
Asp Tyr Arg 
Tyr Asn Arg 
Lys Lys Asn 
Lys Lys Met 



Leu lie lie 
260 

Tyr Asp Lys 
275 

Glu Gin Pro 
290 

Gly Thr Gin 
305 

Val Asp Ser 
320 

Lys Arg Phe 
335 

Glu Val Val 
350 

Arg Gin Pro 
365 

Lys Arg Ala 
380 

Glu Gin Asp 
395 

Cys Asp Glu 
410 

Thr Ser Asp 
425 

Gly Leu Gin 
440 

Asn Lys Glu 
455 

Glu Glu Ser 
470 

Leu Lys Gin 
485 

His Cys Lys 
500 

Val Gly Asp 
515 
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Phe Phe Ala 
Ser Asn lie 
Pro Asn Val 
Asp Val Pro 
Pro Met Ala 
Tyr Pro Glu 
Gin Glu Leu 
Arg Tyr Ser 
Pro Ala Lys 
His Tyr Glu 
Leu Glu Glu 
Gin Gin Arg 
Glu Tyr Lys 
Leu Ser Arg 
Glu Glu Tyr 
Val Lys Gly 
Gin Leu Lys 
Tyr Asp Arg 



Val Lys Thr 
265 

Leu Trp Asp 
280 

Glu Glu Trp 
295 

Ser Pro Pro 
310 

Tyr Ser Ser 
325 

Ser Ser Tyr 
340 

Pro Leu Thr 
355 

Ser Gly Gly 
370 

Gly Arg Ala 
385 

Thr Asp Tyr 
400 

Asp Trp lie 
415 

Gin Leu Tyr 
430 

Ser Leu Gin 
445 

Leu Asp Lys 
460 

Met Ala Ala 
475 

Ser Ala Asp 
490 

Ser Lys Leu 
505 

Gin Lys Thr 
520 



Arg Arg Lys 
270 

Lys Glu His 
285 

Val Lys Asn 
300 

Ser Asp Tyr 
315 

Asn Gly Lys 
330 

Lys Ser Thr 
345 

Ser Pro Val 
360 

Asn Phe Glu 
375 

Gly Arg Ser 
390 

Thr Thr Gly 
405 

Arg Glu Tyr 
420 

Lys Arg Asn 
435 

Ser Glu Leu 
450 

Glu Leu Asp 
465 

Ala Asp Glu 
480 

Tyr Lys Ser 
495 

Ser His lie 
510 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(v) FRAGMENT TYPE: synthetic peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: construct used in experi- 
ments 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Asp Arg Gly Tyr Gly Thr Ser Leu Leu Gly Gly Ser Val Gly 

5 10 15 

Tyr Pro Tyr Gly Gly Ser Gly Phe Gly 

20 



(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(v) FRAGMENT TYPE: synthetic peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: construct used in experi- 
ments 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Cys Ser Tyr Gly Ser Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Tyr 

5 10 15 
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Gly Tyr Gly Gly Tyr Thr Asp Pro Arg 

20 



(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 
(v) FRAGMENT TYPE: synthetic peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: construct used in experi- 
ments 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asn His Tyr Ala Pro Ser Asn Asp lie Tyr Gly Gly Glu Met Val 

5 10 15 

His Arg Pro Met Leu 

20 



(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: peptide 

(v) FRAGMENT TYPE: synthetic peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: construct used in experi- 
ments 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

Ala Ser Gin Gin Val Tyr Arg Lys Asp Pro Cys 

5 10 



